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Haloarcula hispanica has been widely used in haloarchaeal studies, particularly in the isolation of haloviruses. The genome of 
strain N601, a laboratory derivative of the type strain ATCC 33960, was sequenced. Several potentially significant differences 
from the published sequence of the type strain (CGMCC 1.2049 = ATCC 33960) were observed. 
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Haloarcula hispanica is an extremely halophilic archaeon, orig- 
inally isolated from a solar saltern in Spain (1). Being easy to 
cultivate and genetically manipulate, this species is an attractive 
model archaeon and has been used in many studies (2-9), includ- 
ing the isolation of novel haloviruses (10-13). The genome se- 
quence of strain ATCC 33960, the type strain of H. hispanica, has 
been reported (14) and consists of two main chromosomes and 
one megaplasmid. Here we report the genome sequence of H. 
hispanica strain N601, a laboratory-derived mutant that arose 
during routine cultivation of strain ATCC 33960^. 

Haloarcula hispanica N601 was cultivated in 23% magnesium 
minimal medium (MGM) (15). Optical maps were prepared at 
Yourgene Bioscience, Inc. (New Taipei, Taiwan), and used to ori- 
entate contig sequences. DNA was extracted using the cetyltri- 
methylammonium bromide (CTAB) method (16) and sequenced 
at Mission Biotech, Inc. (Taipei, Taiwan), using a Roche GS FLX 
Titanium system. Using Newbler version 2.0.01.14 (Roche), we 
assembled 26 contigs from the pyrosequencing reads, and the op- 
tical maps allowed 14 of these contigs to be aligned with the major 
chromosome and 1 contig with the large plasmid. To sequence the 
second chromosome and the smaller plasmid, these replicons 
were separated by pulsed-field gel electrophoresis, the DNA was 
recovered by electroelution (17), and sequencing was performed 
using both the Roche GS Junior platform and the lUumina ge- 
nome analyzer II platform. lUumina reads were mapped to the 
major chromosome and large plasmid sequences. The remaining 
reads were assembled together with unused 454 contigs using the 
CLC Genomics Workbench 6 (CLC Bio) software. rRNA operons 
and regions that differed from the previously published sequence 
were PGR amplified with Phusion high-fidelity DNA polymerase 
(Thermo Scientific) and cloned and Sanger sequenced at Genom- 
ics BioSci & Tech, Inc. (New Taipei, Taiwan). 

The H. hispanica N601 genome is 3,901,495 bp in length, con- 
sisting of two chromosomes (chrl, 3,006,708 bp, and chrll, 
363,208 bp) and two plasmids (pHH406, 405,815 bp, and 
pHH126, 125,764 bp). The overall GC content is 62.43%. The 
genome contains 3,932 putative protein-coding genes, of which 



2,605 were assigned to clusters of orthologous groups (COG) fam- 
ilies and 658 were mapped to KEGG pathways (18). The coding 
density was 87.13%. There are 48 tRNA genes, 3 rRNA operons, 2 
noncoding RNAs, and 1 clustered regularly interspaced short pal- 
indromic repeat (CRISPR) array. Major differences detected be- 
tween the sequence of strain N60 1 and that of strain ATCC 33960^ 
(CGMCC 1.2049) include (i) the presence of 4 replicons (com- 
pared to 3), (ii) a prophage-like region in chrl (base position, 
22,557 to 34,957) not seen in ATCC 33960^, (iii) a region contain- 
ing two hypothetical protein-coding genes in chrl (base position, 
712,834 to 714,267) not seen in ATCC 33960T, and (iv) a trunca- 
tion event in chrl of strain N601 (loci HISP_06336 and 
HISP_06337). The genome sequence of strain N601, and particu- 
larly the differences from strain ATCC 33960^, may be useful in 
understanding the physiology, genome dynamics, and halovirus- 
host interactions within this species. 

Nucleotide sequence accession numbers. The genome se- 
quences of H. hispanica N601 and its annotation have been 
deposited at GenBank under the accession numbers CP006884 
(chrl), CP006885 (chrll), CP006886 (pHH126), and CP006887 
(pHH406). 
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